CDS

Accession Number TCMCG019C25749
gbkey CDS
Protein Id XP_022956529.1
Location complement(join(1934469..1935612,1936130..1936698,1937100..1937861,1938084..1938275,1938400..1938589,1938734..1938828,1939055..1939098,1939469..1939733))
Gene LOC111458242
GeneID 111458242
Organism Cucurbita moschata

Protein

Length 1086aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023100761.1
Definition nuclear pore complex protein NUP1-like isoform X2 [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category UY
Description Nuclear pore complex protein
KEGG_TC 1.I.1
KEGG_Module M00427        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03019        [VIEW IN KEGG]
KEGG_ko ko:K14317        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03013        [VIEW IN KEGG]
ko05169        [VIEW IN KEGG]
map03013        [VIEW IN KEGG]
map05169        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCGACTGCAAGGGAACGGAAAAGCCGGGAAGAAGAAGGGTTGAGAACGGCGGGGAAGTTTGCAGATAAAAGATTCTTTAGGAAACCGCCGAAAAAACCTTACGATCGGCCGCCGACTACCCTAAGAACATCGGGAAACAATTCGTGGATCTTGAAGCTCGTTGATCCGGCTCAAAGGCTCATTTCCTCTGGTTCTCAGATGCTTTTTTCCTCCGTGTTCCGAAATTTCCCTCACCGTTTACCGTCTCGTACTTCGTCTCCAGAATCAAGCCAGTCAAGAAGGGATGACAAGAAGGCCGATGTAACGGTAGCAGCAGCTAATGTAGGTGATAATCAGAATAGAGCTGATCGATTTGTGATGGTGGAACTTGAAAAAGCTATGAAGCAAAAGACCTTCACCAGGTCTGAGATTGATCATTTGACGGCTCTAATGCATTCAAAAAATGTTGATTTACCTGATGTGAATGAGGAGAAAAGGGTTAAGTTTATCTCTTCTATTCCAGAATCTAACAGGAATGAGTTTAAAAAAATACCAAATTCAGAAGTAAGGATGTGCAGGCAATCGTTCCCAACTCCCATCTTGAGTTCAAGTGTCCTTGATGAAGATATTTCTTCACCTGCAGAGATAGCTAGAGCATACATGGGGAGTAGGCAGCCAAAAATTTGTCCTTCAATGCCATCTTTGCGAGCACAAGGACTTGGGGAAAATTCTGCTCGTCCAACTAGTACATCATTCTCTTCAAAATCAACAGATATGTTGCTTGTGCCATCATCTACTAATCAGGGTTTGAAACGTAGGAGTTCATTTTTTGATAATCACATTGGACCCAATGTTCCTCTGCGCAGAATTGGACAAAAACCTAACATTCATCTCTCGAAGGGATCAAGCTTACCCGTTTCTACTAGACCTATTTCTGTTCCTGTAGATAGACTTAGTTTTGACGCTTCTCAGAGCTCCAAATTTGGGAAAGTTCATAATTTTCCATCTTCCATTTGGAACTCACAATTGTCTCTTAAACCCAAGAAAAATTCTACAAGAAAGTTTATTATGAACGTGGAGAGTGATAACATTCGTGGTGCAGGCAGCAGCTCTATTTATACTCCTTCAAGGTCTTACAAGATGGCTTCTAAGATATTGGAGCAGCTCGATAAGTTGACCCCTCCAAAGGAGAAAGTTAAACGACTTCCTGTTGGGGAAATATCTCCCCCTAAGCTGTCACCATTCACAGTAGATGGGCATCTCAAAATTGTGAAGGATGTGGACTTACCCAGAGATGAAGAACTTGTTCATGACAACAAGCAGTCAATTAGTTTGCATGGCGTTCCATATCATGACAACCAAGAAAACACTTCACAAAATAAAGAGAAGCTGGAAAATATGAAACCATCGGATCCTCATCATAGATGTGCTCTACTGAAGGACTCAGGGTCCATAGGTTCAAGTAAGGATTCCATGATTGATCTTGGAGTGCCTGCGCCTGCTGTGGTGAAATCTATTATTCAGCCCCCAAAAAACAAACTGGCGTTTCAGATGTGGCCTGACAAGGATCGTGTAGACCAGGATGAAAGTTCTCCTGATAGAGTTGCACCTGCTACCGCGGAGGATAGGGAAGGTGACATTTCTTTGGCTGTGAGACAAACAACTGCTAATGAGACTCTAGCACCATCAAAGCCACAAACTGCATCTGAAGTGATAGTGGGTTCTCCTCTCAACAGAAGTTCTGATTTGAAAACTTCTGAAGGTAGCGTTCATGATGATATGGATACCAGTTTTACGTTCCAAATTGCACCTTCACAACCAGAAACTATTGATTCTGCACCCACCAATTCTTTTGGAAACAATGATCTTCCAGAAAAGAAGCGAATTGATTCTCCAGTTTTTAGCTTTGGAAATAATGTCTCTCCACGAAAGCAGCCAAACGCTAGTTCTACTGCATTTGATGTTGGGAATAAGGATGCTTCTCGAACAGAATTATGTGCTGCTCCCGAAAATGGCAATGGAGCTCCATTCCCATACACGCAGTGGAATCCAGCTTCTTCATATTCAGATGTTCAAGGATCAGTGTATTTAAACGCCGTTGCATCTTCAAACCATAAGCTAGATTGCTCTTGGGGGACTTGCAATGATGCATTCTCATCTTCTGCCTCCATATCAGCTGGACTTGCGGTCTCATTTTGCTCGACTGCTAGATATCAAAGTCTAAATAATGGCCTTTCCATTTCATGTCCATCTCAATACTCATCGTGCAGTCTGCTAACTCCATCTATGGGGCAAAGTTCATCCAGATATATCTTCCTCTCAGCCAAATGTGCCAGCAACGATGCTAATATAACCACCAATGGCAAGCACCCGTCAACCACAAATGTGATAACTTCATCTGCTCCATCGGCTATGGGCTTAGGAACTCATGAAGACAAGATCAAGCAGGATGCAAGCCTGCACATTGCGAATAACACTTATTTCAGTAGCATATCTACACCAGCAAATTCTCACTATAATATGTTCAGCTTCAATCCTGGAGCAACGCCTTCGTTTGTGAATAATCATCAGTTGAGTACACCTACTGTTAGCAGTGCACCTGAGCTTAGTGCTCAGGGAGCTTCTGCTGGAAAGGAATTTACAGCTAATGCGGAACAAACTTCAATCCTTATGGGATCATTCATGTCACATGCATCATCAGCGATGGCTGGAAAAGCATCCATCTCTTCTGGCATTTCTTTTGGTTGCTCATCTCCTGCTTCTGAACTGTTTCATTCAGGAAGCAGGCCATCGGAATTTCCCATCACTGGGTTTACTTGTGCCCCAGCAACTTCAACCCATTTTTCTACTCCTAGGACACATCTTGGGTTCGAGTCATTTACAGGGGCGTCTTTCAGTTCAATATGTTCTACAACCTCAGCAGCAGCAATAGCATGTTCCTCATCGAAGACTGTTTCAAGTAATTCTCATCCCACAGTTGCTTTTAGAGTTTCTACAGGTAACAATGACTGTGAAGATCAGGGTACCTCCAAGGACAATGTTCCAATTTTCAGTCAAAAGCCAGTCCCACCCCCTTCATCAGGATTCTCTTTTGGTCAAGCCACCTCTGAATCAAATCCCTTTCTAGTTCAAAAGCAGCAGACATTGGCTAAACCCCAAAATTCTTCTCCATATATTGCTCATTCTAGCAGCTTAGAAGCTAGAGGCAGCTTCCCCTTGAGTGCTGGCGGCGGTAACAAGGCTAGCCGGAGACTTGTGAAGGTCAAACGAAAGAAATAA
Protein:  
MATARERKSREEEGLRTAGKFADKRFFRKPPKKPYDRPPTTLRTSGNNSWILKLVDPAQRLISSGSQMLFSSVFRNFPHRLPSRTSSPESSQSRRDDKKADVTVAAANVGDNQNRADRFVMVELEKAMKQKTFTRSEIDHLTALMHSKNVDLPDVNEEKRVKFISSIPESNRNEFKKIPNSEVRMCRQSFPTPILSSSVLDEDISSPAEIARAYMGSRQPKICPSMPSLRAQGLGENSARPTSTSFSSKSTDMLLVPSSTNQGLKRRSSFFDNHIGPNVPLRRIGQKPNIHLSKGSSLPVSTRPISVPVDRLSFDASQSSKFGKVHNFPSSIWNSQLSLKPKKNSTRKFIMNVESDNIRGAGSSSIYTPSRSYKMASKILEQLDKLTPPKEKVKRLPVGEISPPKLSPFTVDGHLKIVKDVDLPRDEELVHDNKQSISLHGVPYHDNQENTSQNKEKLENMKPSDPHHRCALLKDSGSIGSSKDSMIDLGVPAPAVVKSIIQPPKNKLAFQMWPDKDRVDQDESSPDRVAPATAEDREGDISLAVRQTTANETLAPSKPQTASEVIVGSPLNRSSDLKTSEGSVHDDMDTSFTFQIAPSQPETIDSAPTNSFGNNDLPEKKRIDSPVFSFGNNVSPRKQPNASSTAFDVGNKDASRTELCAAPENGNGAPFPYTQWNPASSYSDVQGSVYLNAVASSNHKLDCSWGTCNDAFSSSASISAGLAVSFCSTARYQSLNNGLSISCPSQYSSCSLLTPSMGQSSSRYIFLSAKCASNDANITTNGKHPSTTNVITSSAPSAMGLGTHEDKIKQDASLHIANNTYFSSISTPANSHYNMFSFNPGATPSFVNNHQLSTPTVSSAPELSAQGASAGKEFTANAEQTSILMGSFMSHASSAMAGKASISSGISFGCSSPASELFHSGSRPSEFPITGFTCAPATSTHFSTPRTHLGFESFTGASFSSICSTTSAAAIACSSSKTVSSNSHPTVAFRVSTGNNDCEDQGTSKDNVPIFSQKPVPPPSSGFSFGQATSESNPFLVQKQQTLAKPQNSSPYIAHSSSLEARGSFPLSAGGGNKASRRLVKVKRKK